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ABSTRACT 


This final report presents our latent research activity In voioe 
compression. *■ Ve have designed a non-real ties simulation system that Is 
Implemented around an IBM-PC where the IBM-PC Is used as a speech work station 
for data acquisition and analysis of voice samples. A real-time Implementa- 
tion la also proposed. This real-time Voioe Compression Board (VCB) Is archi- 
tected around Texas Instruments TKS-3220. 

The voice compression algorithm Investigated here was described In an 
earlier report titled. "Low Cost Voioe Compression for Mobile Digital Radios." 
by the author. Ve will assume the reader Is familiar with the voioe compres- 
sion algorithm discussed In this report. The VCB compresses speech waveforms 
at data rates ranging from 4.1 E bps to Id K bps. This board Interfaces to 
the IBM-PC S-blt bus. and plugs Into a single expansion slot on the mother 
board. 


SUMMARY OF RESEARCH 


!• fiPMfih Mark station 

To provide an Inexpensive speech work station, we selected the IBM-PC. 
With the large pool of software/hardware available for this personal oomputer, 
a oospletely aelf-oontalned speech work station was Integrated. 

Data acquisition hardware neoeasary for the speech work station sust be 
Cosposed of the following modules: 

.12-blt A/D module with minimus l :0 microsecond conversion tine 
.12-blt A/D nodule with nlnlnun 20 microsecond settling tine 

The following features are required: 

.Progrnanable Input gain: ranging 2.4.8 
.Single Differential Input channel 
.Programmable sampling rate 
.Fully compatible with IBM Personal Computer 

We decided to use the Data Translation (Marlborough. Massachusetts) DT- 
2801 A, a single board analog and digital I/O system for the IBM-PC. This 
board oontalns all of the above modules and has 16 single-ended or 8 differen- 
tial A/D Input channels. . 

Software drivers for this board are available from numerous vendors. We 
decided to use the Interactive Laboratory System (ILS) software package for 
the IBM-PC. This software package Is developed by the Signal Technology Corp. 
(Goleta, California), ILS performs numerous functions In a variety of areas 
Including: signal display, spectral analysis, display and editing functions. 

An analog Interface circuitry was designed to Interface a microphone to 
a single differential Input channel for the A/D operation, and a speaker to a 
single D/A channel. 

Voloe compression experiments are performed by first sampling the Input 
speech waveform. The speech samples are stored on the hard disk drive. A 
program was written to translate the ILS sampled data files to standard ascii 
character files. This file Is then transferred via the serial RS-232 port on 
the IBM-PC to the Vax 11/750. Another program oonverts this ascii file to a 
binary packed format. The voice compression program reads this Input file, 
and creates an output file which contains the aynthesized compressed speech 
samples. This file Is translated from the binary packed format to ascii for- 
mat, and then transferred to the IBM-PC. This ascii file Is translated Into 
an ILS sample data file. The oompressed speech may now be played on the 
speaker by performing D/A operation on this file. 


In Fig. (1) we have shown this development environment. 

We shall now discuss the experimental results that we have obtained by 
using this development station. 


I • Jte gS ClBfcifln Si Experiments 

In the course of our experiments, we evaluated several different varia- 
tions of the voloe compression algorithm, namely, 

1. Filter Coefficient Smoothing 

2. Automatic Gain Control 

3. Fixed Point Implementation 

4 . Parmneter Quantization 

Filter Coefficient Smoothing: In this version, the filter coefficients 

during each sampling period are smoothed by linear interpolation. This is 
done for extension filtering of the tree search algorithm, and the synthesis 
filter. 

Automatic Gain Control: In this version, a gain factor Is used to scale 

consecutive samples of the same value, l.e., 

„ “i- i; If al - *1-1 

«i » 

1 ; otherwise 

This gain Is used In both extension filtering and synthesis filter. 

Fixed Point Implementation: To Implement the voice compression algo- 

rithm on a digital signal processor. It is necessary to perform all the arith- 
metic In the field of real Integers modulo 2 1 . 

Parameter Quantization: To transmit the filter coefficients over the 
communication channel, It Is necessary to quantize the filter coefficients 
(Total of 8) using a parameter quantization schmne as described In the previ- 
ous report. 

In the enclosed appendix we have Included a set of experiments that we 
have performed. In Experiments 101 through 107, the speech waveforms for the 
duration of each Utterance Is displayed. Furthermore, the spectrum and the 
waveform for different segments of each utterance is also depicted. 


1. Suwary Si Sue Experiments 

Filter smoothing does smooth the oom pressed voloe signal. However, It 
degrades the Intelligibility of high frequency, low amplitude (ex. nasal con- 
sonants) signals. 


The oompreaaed voioe Is extremely aonsl tlve to the constant gain coeffi- 
cient a. A gain value of 1.2 resulted In the beat perforaance. The vowels 
were allghtly diatorted uaing thia technique. 

The results of the integer laplenentation of the voice compression algo-’ 
ritha were quite exciting. Ve found no loaa in perforaance when 16-bit reao- 
lution with proper acallng waa used. 

Parameter quantization reaulted in negligible loaa of perforaanoe. 

Baaed on our observations and the reaulta of our experiments, one aay 
conclude that from the wavefora tracking viewpoint of the tree eearoh algo- 
rithm, thia algorithm performa quite well. The reconstructed apeech wavefora 
cloaely tracks the original apeech wavefora. There is. however, acme tracking 
error for high frequency components (3 Khz and above). Froa the epeotrua Of 
the reconatructed apeech, it ia clear that the spectrum of the synthesized 
apeech and original apeech reaeable each other. It la, however, difficult to 
extract the signal- to- noise ratio directly from the spectrum. 1 

From the intelligibility viewpoint, this voice eoapreaaion algorithm 
perforaa quite well. And given our Integer Implementation of this algorithm, 
we have ahown the viability of building a VCP using a digital algnal processor 
with lfi-bits of resolution. 


[1] It would be more appropriate to compute spectrogram for these 
waveforms, rather than the three-dimensional spectrum. Unfortunately, we did 
not have this facility. 


4 . 


JtaAl-UnS SsiSS Compression Board 



| The primary candidates for the real-time process or were Fujitsu MB-8764 

\ and Texas Instruaents TK5-32020. MB- 87 64 was rejected, based on a comparison 

of tho Instruction set and the available support for this processor. 

' * 

1 In Fig. (2) we show Uie architecture for a real-time voice compression 

] processor. A standard IBM^PC wire wrap board, which plugs into the XBM-PC bus 

I Is used to build the prototype for the VCB. 

• At initialization time All the programs are down-loaded via the IBM- PC 

! bus Into the global two- port memory. Each processor then ooples the proper 

segment of each task Into Its local RAM. 

i 

Based on the computational complexity of the voioe compression algo- 
ij rlthm. It has been determined that to implement the voioe compression algo- 

| rlthm In real-time. It requires at least two TM5-32020 to execute the algo- 

| rlthm. For this reason the algorithm Is broken down In two segnents. The 

analysis filter processor performs the wlthenlng operation and outputs the 
I residue sequence. And the filter coefficients are oomputed. The aeoond pro- 

I cessor performs all the tree filtering algorithm and the reconstruction 

filter. 

] All task synchronizations are performed via mailboxes resident in the 

global memory. The second processor lags behind the first processor by a full 
frame cycle (10 msec). The operation of the two processors is completely 
overlapped. 

All of the results from the analysis filter are oopled from the local 
RAM to the global RAM. The aeoond processor reads this block of data from the 
global RAM and copies it Into its local RAM. The output of the aeoond proces- 
sor Is binary-packed data, and quantized parameters. These results are acces- 
sible from the PC-Bus. The host processor (Intel 8088) may access these 
results from the PC-Bus and transmit them through the serial RS-232C port, 
which can be transmitted via a modem to the circuit- switched telephone lines. 

The serial bus on the first TKS-32020 is used for A/D operation, and the 
serial bus on the second processor Is used for the D/A operation. This off- 
loads the processor memory bandwidth. 

i 

The real-time Implementation of the voioe compression algorithm was out- 
| lined. Some experimental results of the variation of the voice compression 

algorithm were stated. These results showed the tracking performance of the 
voice compression algorithm. 

I 

I This report oonflrms the feasibility of building a VCB using two TMS- 

| 32020. This board may be used for low-cost digital mobile voice terminal 

| which can interface to the telephone line or other transmission medltms. 
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